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ABSTRACT. We give a complete description of the minimal intervals containing all fractional parts 
{£2 n }, for some positive real number £, and for all n > 0. 



1. Introduction 
In the paper [20] Mahler defined the set of Z-numbers by 

3\ n l 1 



e € R, £ > 0, Vn > 0, < U ( 2 ) | < 2 

where {z} is the fractional part of the real number z. Mahler proved that this set is at most countable. 
It is still an open problem to prove that this set is actually empty. More generally, given a real number 
a > 1 and an interval (x, y) C (0, 1) one can ask whether there exists £ > such that, for all 
n > 0, we have x < {£a n } < y (or the variant x < {£a n } < y). Flatto, Lagarias, and Pollington 
Ifl4l Theorem 1.4] proved that, if a = p/q with p, q coprime integers and p > q > 2, then any 
interval (x, y) such that for some £ > 0, one has that {C(p/q) n } 6 y) for all n > 0, must satisfy 
y — x > l/p. Recently Bugeaud and Dubickas [8] characterized irrational numbers £ such that for 
a fixed integer 6 > 2 all the fractional parts {£6 n } belong to a closed interval of length 1/6. Before 
stating their theorem we need a definition. 

Definition 1. Given two real numbers a and p with a > 0, we denote by s a p := (s a:P (n)) n >o and 

s 'a, P '■= ( s 'a, P ( n ))n>o the sequences defined by 

s a , P (n) = [in + l)a + p\ — [na + p\ and s' a p {n) = \{n + l)a + p] — \na + p] for n > 0, 

where [^J denotes the greatest integer < x and \x] denotes the least integer > x. The sequences 
s Q p and s' a are called Sturmian sequences if a is irrational, and periodic balanced sequences if a 
is rational. Furthermore, if a = p, these sequences are called characteristic Sturmian sequences or 
characteristic periodic balanced sequences, according to whether or not a is irrational. 

We observe that, if a is not an integer, then for all n > 0, 

[aj < s a , p (n) < |aj + 1 and \a\ - 1 < s' ap (n) < \a], 

where [a] — 1 = [a\ and [a] = [a\ + 1. On the other hand, if a is an integer, then s QjP = s' a 
and a < s a , P (n) < a + 1 for all n > 0. Accordingly, the sequences s atP and s' a take their values in 
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the "alphabet" {k, k + 1} where k = [a\ . The classical definition of Sturmian sequences with values 
in {0, 1} is thus obtained by subtracting [a\ from each of the terms in the sequences s a>p and s' . 
Alternatively, one may restrict a to the interval (0,1). Hereafter, if the alphabet is not mentioned, it is 
understood that the sequences are over {0, 1}. We may also assume that p G [0, 1) or p G (0, 1] since 
s a,p = s a ,p' and s' a p = s' a pl for any two real numbers p, p' such that p — p' is an integer. 

Example 2. Taking a = p = (3 — \/5)/2, we get the well-known (binary) Fibonacci sequence 
0100101001001010010100100101 

Remark 3. Note that if a is irrational, then the (Sturmian) sequences s a p and s' a are aperiodic (i.e., 
not eventually periodic), whereas if a is rational, the sequences s a>p and s' a are (purely) periodic. 
(See for instance |[T9l Lemma 2.14].) This justifies the use of "periodic" in the name of such sequences 
in the rational case. The reason for being called "balanced" is explained in Section |3~T1 

Let T denote the shift map on sequences, defined as follows: if s := (s n ) ra >o, then T(s) = 
T((s n ) n >o) := (s ra+ i) n >o- The main result in [8] reads as follows. 

Theorem 4 (Bugeaud-Dubickas). Let b > 2 be an integer and let £ be an irrational number. Then 
the numbers {£,b n } cannot all lie in an interval of length < 1/6. Furthermore there exists a closed 
interval I of length 1/6 containing the numbers {£6 n } for all n > if and only if the sequence of 
base b digits of the fractional part oft, is a Sturmian sequence s on the alphabet {k, k + 1} for some 
k G {0, 1, . . . , b — 2}. If this is the case, then £ is transcendental, and the interval I is semi-open. It 
is open unless there exists an integer j > 1 such that T 3 (s) is a characteristic Sturmian sequence on 
the alphabet {k, k + 1}. 

The purpose of this paper is to give a complete description of the minimal intervals containing all 
fractional parts {£2 n } for some positive real number £, and for all n > 0. More precisely, inspired 
by the definition of the lexicographic world (see Section |2!2l . let us define a function F on [0, 1] as 
follows. 

Definition 5. Vx G [0, 1], F{x) := inf{y G [0, 1], 3£ > 0, Vn > 0, x < {£2™} < y}. 

Remark 6. The function F is well defined since for y = 1 there exists such a real number £ (namely 
£ = 1). Note that this simple remark and Bugeaud-Dubickas' result for b = 2 imply the following two 
facts. 

• For x > \, one has Fix) = 1, i.e., £ = 1 is the only £ such that x < {£2 n } < 1 for all n > 0. 

• If £ be a positive irrational real number, then there exists a real number x such that all the 
fractional parts {£2 n } belong to the interval [x, x + \\ if and only if the base 2 expansion of 
£ is a Sturmian sequence. Furthermore, for any such x, one has F(x) = x + \. 

Before stating our main theorem, let us note that the sequences s a p and s' a (given in Definition [B 
are said to have slope a and intercept p, in view of their geometric realization as approximations to 
the line y = ax + p (called lower and upper mechanical words in [19, Chapter 2]). From now on, 
we will assume that a and p are in the interval [0, 1], in which case the sequences s Q p and s' a take 
their values in {0,1}. If a is irrational, then we have s aiCe = s' aa , denoted by c a . We also have 
s o,o = s o,o = 0°° an d 8 i,l = s 'i l = 1°°> denoted by cq and c\, respectively. In these cases, the 
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sequence c a is the unique characteristic Sturmian sequence of slope a in {0, 1} . On the other hand, 
if a G (0, 1) is rational, then the characteristic periodic balanced sequences of slope a, namely s a>a 
and s' aa , are distinct sequences in {0, 1} N containing both 0's and l's. More precisely, let us suppose 
that a = p/q G (0, 1) with gcd(p, q) = 1. Then by considering the prefix of length q of each of the 
sequences s p / q and s' p , Q , we find that there exists a unique word w p ^ q of length q — 2 such that 

s p /g,o = (0w P , g l)°° and s' p/q0 = (lu^O) 00 

where v°° denotes the periodic sequence vvvv • • • for a given word v. (See for instance lfT9l pg. 59].) 
Hence, the two characteristic periodic balanced sequences of slope p/q in {0, 1} N are given by 

s p / q ,p/q = T(s p/q>0 ) = K, 9 10)°° and s' p/q>p/q = T{s' p/qQ ) = (w Ptq 01)°°. 

The words w PA are often referred to as central words in the literature; they hold a special place in 
the rich theory of Sturmian sequences (see, e.g., lfl9l Chapter 2]). For instance, it follows from the 
work of de Luca and Mignosi [11, 12J that central words coincide with the palindromic prefixes of 
characteristic Sturmian sequences (see Section 1331) . 

Given a sequence s G {0, 1} N , let r(s) denote the real number whose sequence of base 2 digits is 
given by s. Our main number-theoretical result reads as follows. 

Theorem 7. Let x be a real number in [0,1]. 

(i) Ifx > \, then F(x) = 1. 

(ii) Ifx = 0, then F(x) = 0. 

(iii) If x < | and if the base 2 expansion of^ is given by a characteristic Sturmian sequence, then 
F(x) = x + ^. Furthermore, F(x) is the unique real number in [0, 1] that has a Sturmian 
base 2 expansion and satisfies x < {F(x)2 k } < F(x)for all k > 0. 

(iv) Ifx < ^ and the base 2 expansion of^ is given by a characteristic periodic balanced sequence 
of slope p/q G (0, 1) with gcd(p,q) = 1, then F(x) is the rational number whose base 2 
expansion is given by the periodic balanced sequence s ' p / q Q = (lwp,<?0)°°, in which case 
F(x) < x + \. 

(v) In all other cases, F{x) can be explicitly computed: it is equal to the rational number whose 
base 2 expansion is given by a (unique) periodic balanced sequence s ' p / q0 = (l^p,(j0)°° 
where p, q are coprime integers with < p < q such that r((w Pjg 01)°°) < | < 
r((wp tq W)°°). In these cases, F(x) < x + I- 

Moreover, in cases (iv) and (v), F(x) is the unique real number in (0, 1) whose base 2 expansion is 
given by a periodic balanced sequence and which satisfies x < {F(x)2 k } < F(x) for all k > 0. 

Remark 8. It is known (see ||T3l ) that real numbers having a Sturmian base 2 expansion are transcen- 
dental. As a consequence of Theorem[7J we deduce that, if x is an algebraic real number in [0, |], then 
F(x) is rational. 

2. The combinatorial approach 

The main tool used by Bugeaud and Dubickas is combinatorics on words: real numbers are replaced 
by their base b expansion, and inequalities between real numbers are transformed into (lexicographic) 
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inequalities between infinite sequences representing their base b expansions. We will establish a the- 
orem of combinatorial flavour (Theorem [T3l . whose translation into a number-theoretical statement 
is exactly Theorem [7j above. A method for computing F(x) in Case (v) of Theorem [7j is given in 
Section [6l 

2.1. Two combinatorial theorems. It happens that the case b = 2 of Bugeaud-Dubickas' theorem 
was already proved by Veerman in l25l l26l . The combinatorial result proved by Veerman, and by 
Bugeaud-Dubickas is stated (and strengthened) in Theorems l9l and [TOlbelow. 

Theorem 9. An aperiodic sequence s := (s n ) n >o on {0, 1} is Sturmian if and only if there exists a 
sequence u := (u n ) n >o on {0, 1} such that Ou < T k (s) < lufor all k > 0. Moreover, u is the 
unique characteristic Sturmian sequence with the same slope as s, and we haveOu = inf{T fc (s), k > 
0} and lu = sup{T k (s), k > 0}. 

Theorem 10. An aperiodic sequence u on {0,1} is a characteristic Sturmian sequence if and only if 
for all k > 0, 

Ou < T k (u) < lu. 

Furthermore, we have Ou = inl{T k (u), k > 0} and lu = sup{T k (u), k > 0}. 

2.2. The lexicographic world. As discussed in [1], the results in Theorems l9l and [TOl have been 
rediscovered several times since the work of Veerman in the mid-late 80's. One of the presentations 
of these statements is due to Gan lfl6l . It is based on the lexicographic(al) world, which seems to have 
been introduced in 2000, in a preprint version of [18 ]. 

For any two sequences x, y <E {0,1} N , define the set 

S x , y ■= {s € {0, l} n , \/k > 0, x < T k (s) < y}, 

where < denotes the lexicographic order on {0, 1} N induced by < 1. The lexicographic world C is 
defined by 

£:={(x,y)G{0,lfx{0,l} N , £ x , y ^ 0}. 
Moreover, by [16, Lemma 2.1], we have 

£ = {K")e{o,fx{o,i}%>M, 

where (f> : {0, 1} N -» {0, 1} N is the map defined by 

4>(x) :=inf{y G {0, if, S x>y £ 0}. 

Trivially, (j>(lx) = 1°° = 111 • • • for any sequence x G {0, 1} N . 

In lfl6l . Gan showed that for any sequence u G {0, 1} N , the set £ou,lu is not empty, i.e., there 
exists a sequence s G {0, 1} N such that Ou < T k (s) < lu for all k > (see |[T6l Lemma 4.2]). Fur- 
thermore, the sequence <fi(0u) has the foregoing property (by [ 16, Theorem 3.4]) and it is a Sturmian 
or periodic balanced sequence with the property that T k (cf)(0u)) < 4>(0u) for all k > (see |[T6l 
Theorem 4.6]). Moreover, by |[T6l Lemma 5.4], the set Ho u ,i u contains a unique Sturmian or periodic 
balanced sequence satisfying T k (s) < s for all k > 1. We deduce from these remarks that, for any 
sequence s G {0, 1} N , if s = (p(0u) for some sequence u G {0, 1} N , then s is the unique Sturmian or 
periodic balanced sequence satisfying Ou < T k (s) < lu and T k (s) < s for all k > 0. The converse 
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of this statement also holds by [16, Corollary 5.6]. These observations establish Gan's main theorem 
(see below), which shows in particular that any element in the image of <fi is a Sturmian or periodic 
balanced sequence in {0, 1} N (and such sequences are the lexicographically greatest amongst their 
shifts). 

Theorem 11. II 1 61 Theorem 1.1] For any sequence s G {0, 1} N , the following conditions are equiva- 
lent. 

(i) s = 4>(0u) for some sequence u G {0, 1} N . 

(ii) s is the unique Sturmian or periodic balanced sequence satisfying Ou < T k (s) < lu and 
T k (s) < sforallk> 0. 

Note. "Sturmian" in Gan's paper corresponds to what is called here (and classically) "Sturmian or 
periodic balanced". 

Remark 12. It is well known that the closure of the shift-orbit of a characteristic Sturmian sequence s 
(i.e., the closure of {T k (s), k > 0}, denoted by O(s)) is precisely the set of all Sturmian sequences 
having the same slope as s (see for instance [19, Propositions 2.1.25 and 2.1.18], or [21]). In view of 
this fact, Gan's result can be strengthened using Theorem [9[ as follows. If the sequence s := <fi(0u) 
is Sturmian, then u is the unique characteristic Sturmian sequence in O(s), in which case s = lu. 

We will further strengthen Gan's result by describing cj)(0u) for any given sequence u G {0, 1} N . 
In particular, we will show that when u contains both 0's and l's and is not a characteristic Sturmian 
sequence, there exists a unique pair of characteristic periodic balanced sequences s and s' of (rational) 
slope p/q G (0, 1) with gcd(p, q) = 1, such that s' < u < s, in which case <p(0u) = Is'. Moreover, 
the sequences s, s' can be explicitly determined in terms of u. 

With the same notation as in the Introduction, our main combinatorial theorem reads as follows. 

Theorem 13. Let ube a sequence in {0, 1} N . 

(i) (f>(lu) = 1°°. 

(ii) Ifu G {0°°, 1°°}, then 4>(0u) = u. 

(iii) If u is a characteristic Sturmian sequence, then <p({)u) = lu. Furthermore, lu is the unique 
Sturmian sequence in {0, 1} N satisfying 0u < T k {lu) < lu for all k > 0. 

(iv) If u is a characteristic periodic balanced sequence of rational slope p/q G (0, 1) with 
gcd(p,g) = 1, then 0(Ott) = s' p/q Q = (lwp^O) 00 . 

(v) If u does not take any of the forms given in parts (ii)-(iv), then there exists a unique pair of 
coprime integers p, q with < p < q such that (wp^Ol) 00 < u < (wp^W) 00 , in which case 
0(Oti) = s' p/qfi = {lw m 0)°°. 

Moreover, in cases (iv) and (v), 4>(0u) is the unique periodic balanced sequence in {0, 1} N satisfying 
Ou < T k (4>(0u)) < <p(0u)forall k > 0. 

In the next section, we will recall some generalities about Sturmian and periodic balanced se- 
quences. (For more on Sturmian sequences, the reader can consult, e.g., |[T9l Chapter 2].) The proof of 
Theorem [T3lis given in Section[4j and a corollary is stated in Section[5] Lastly, in Section[6l we show 
how to determine the "central word" w Pjq such that <f>(0u) = (lwp^O) 00 for any "generic" sequence 
u falling into Case (v) of Theorem [131 above. 
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3. STURMIAN & PERIODIC BALANCED SEQUENCES 

In what follows, we will use the following notation and terminology from combinatorics on words 
(see, e.g., [19]). Let w = x\x% • • • x m be a word over a finite non-empty alphabet A (where each 
Xi is a letter in A). The length of w, denoted by \w\, is equal to m. The empty word is the unique 
word of length 0, denoted by e. The number of occurrences of a letter x in w is denoted by \w\ x . The 
reversal of w is defined by w = x m ■ ■ ■ x^X\, and by convention e = e. If w = w, then w is called 
a palindrome. An integer £ > 1 is said to be a period of w if, for all i, j with I < i,j < m, i = j 
(mod £) implies x% = Xj. Note that any integer £ > \ w\ is a period of w with this definition. The word 
w is said to be primitive if it is not a power of a shorter word, i.e., if w = u n implies n = 1. A finite 
word z is said to be a factor of w if z = a^Xj+i ■ • • Xj for some i,j with 1 < i < j < m. Similarly, a 
factor of a sequence s := so s i s 2S3 • • • is any finite word of the form SjSj + i • • • Sj with i < j. 

Recall from the Introduction that the shift map T is defined on sequences as follows: if s := 
{s n )n>o then T(s) = T((s n ) n >o) := (s n +i)n>o- This operator naturally extends to finite words as a 
circular shift by defining T(xw) = wx for any letter x and finite word w. 

Under the operation of concatenation, the set ^4* of all finite words over A is a free monoid with 
identity element e and set of generators A. If x is a letter, then we use x* to denote {x}*, the set of all 
finite powers of x. From now on, all words and sequences will be over the alphabet {0, 1}. 

3.1. Balanced sequences. All Sturmian sequences are "balanced" in the following sense (see for 
instance ll22l [TU1 151 l6l [191 ). 

Definition 14. A finite word or sequence w over {0, 1} is said to be balanced if, for any two factors 
u, v of w with \u\ = \v\, we have | \u\i — \v\i\ < 1 (or equivalently \\u\o — \v\o\ < 1). 

Recall from Remark[3]that Sturmian sequences are aperiodic. Morse, Hedlund, and Coven ll22l[T0l 
proved that the Sturmian sequences are precisely the aperiodic balanced sequences on two letters 
(also see ||T9l Theorem 2.1.3]). "Periodic balanced sequences" (as specified in Definition [T]) are also 
balanced in the sense of the above definition (which justifies their name); moreover, they constitute 
the set of all periodic balanced sequences on two letters (see fT9l Lemma 2.1.15] or |[24l ). 

3.2. Characteristic Sturmian sequences. In ifTTI . characteristic Sturmian sequences were character- 
ized using iterated palindromic closure, defined as follows. The palindromic ( right- )closure of a finite 
word w, denote by w^ + \ is the (unique) shortest palindrome beginning with w. That is, if w = uv 
where v is the longest palindromic suffix of w, then ?z/ + ) := uvu. For example, (011) (+) = 0110. 
The iterated palindromic closure function, denote by Pal, is defined by iteration of the palindromic 
right-closure operator (see, e.g., IfTTI ). More precisely, Pal is defined recursively as follows. Set 
Pal(e) = e, and for any word w and letter x, define Pal(wx) := (Pal(w)x)^ . For example, 
PaZ(Oll) = (Pa/(01)1)M = (0101)W = 01010. Note that Pal is injective; and moreover, it is 
clear from the definition that Pal(w) is a prefix of Pal{wx) for any word w and letter x. Hence, if v 
is a prefix of w, then Pal(v) is a prefix of Pal(w). 

The following theorem provides a combinatorial description of characteristic Sturmian sequences 
in terms of Pal. 
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Theorem 15. jll] For any sequence s G {0, 1} N , the following properties are equivalent. 

(i) s is a characteristic Sturmian sequence. 

(ii) There exists a (unique) sequence A := xoX±X2X 3 . . . G {0, 1} N \ ({0, 1}*0°° U {0, 1}*1°°) 
(i.e., not eventually constant), called the directive sequence of s, such that 

s = lim Pal(xQX\X2 ■ ■ ■ x n ) = Pal(A). 

n^oo 

Example 16. Recall from Example |2] that the (binary) Fibonacci sequence / = 01001010010 • • • is 
the characteristic Sturmian sequence c a with a = (3 — V5)/2; it has directive sequence (01)°°. That 
is: 

/ = Pa/(0101 • • • ) = 01001010010 
where the underlined letters indicate at which points palindromic closure is applied. Note the simple 
continued fraction expansion of a = (3 — \/5) /2 is [0; 2,1,1,1,...]. More generally, if a £ (0, 1) is 
an irrational number with simple continued fraction expansion [0; d\ + 1, g?2> ^3, d^, . . .} where d\ > 
and all other d { > 1, then c a = Pal(0 dl l d ^0 d n di ■■■) (see ESI El and also [2, pg. 206]). 

3.3. Characteristic periodic balanced sequences. We will now recall some known combinatorial 
descriptions of the characteristic periodic balanced sequences in {0, 1} N (see Proposition [T71 and Re- 
mark[T8lbelow). 

Let us first recall from the Introduction that the characteristic balanced sequences of slopes and 1 
are c$ = 0°° and c\ = 1°°, respectively. For all other rational slopes p/q G (0, 1) with gcd(j>, q) = 1, 
there are exactly two characteristic periodic balanced sequences of slope p/q, given by 

s p /q,p/q = T ( s p/ q ,o) = (w P ,qW)°° and s' p/qp/q = T(s' p/qQ ) = (u^Ol) 00 , 

where w p ^ q is a word of length q — 2 in {0, 1}*. For example, with p = 2 and q = 5, we obtain the 
following two characteristic periodic balanced sequences of slope 2/5: 

«2/5,2/5 = (01010)°° and 4/5,2/5 = (01001)°° where w 2 , 5 = 010. 

Notice that ^2,5 is a palindrome and |w2,5l0|i = |w2,501|i = 2 = p. More generally, one can verify 
that all words w Pjq are palindromes and Itt^lOli = | tt?p )(? 01 1 1 = p. Furthermore, the words w Pjq 10 
and w Pt q01 (which have length q) are primitive since gcd(p, q) = 1. Hereafter, the word w p ^ q will be 
called the central word of slope p/q; it is the unique central word of length q — 2 containing p — 1 
occurrences of 1. 

Note. The set of all central words of slope p/q £ (0, 1) (where p, q are coprime integers) coincides 
with the family of "central words" in {0, 1}* as defined in (19[ Chapter 2] (in particular, see |fl9l 
Theorem 2.2.11 and Proposition 2.2.12]). 

The following proposition collects together some equivalent definitions of central words. For many 
more, see the nice survey HI . 

Proposition 17. For any word w £ {0, 1}*, the following properties are equivalent. 

(i) w is a central word. 

(ii) 0u>l and lwO are balanced 1121 . 

(iii) w = Pal(v) for some word v G {0, 1}* iflUfTII 
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(iv) w has two periods £, m such that gcd(^, m) = 1 and \w\ = I + m — 2 111011121 . 

(v) M)S0*Ul*U(Fn P10P) where P is the set of all palindromes in {0, 1}* lH2l 

(vi) w G 0* U 1*, or there exists a unique pair of words w\, W2 G {0, 1}* such that w satisfies the 
equation w = w\Qlw2 = u^lOwi lTT2l ITTTl . 

Moreover, in part (vi), w\ and W2 are central words, i\ := \w\\ + 2 and £2 '■= \w2\ +2 are coprime 
periods of w, and min{^i, I2} is the minimal period ofw Q. 

Note. P n (PWP) =PD (P01P). 

Furthermore, by lPT9l Proposition 2.2.12], the central word w p>q of slope p/q is the central word 
with coprime periods I, m where I + m = q and mp = 1 (mod q). For example, the central word 
W2,5 = 010 has coprime periods £ = 2 and m = 3 where 2 + 3 = q and mp = 6 = 1 (mod 5). Also 
note that w\ >q = q ~ 2 and w q -i tq = l q ~ 2 ; in particular w\ t 2 = £• 

Remark 18. Let p, q be coprime integers with < p < q. Then p/q has two distinct simple continued 
fraction expansions: 

p/q=[0;d 1 + l,...,d n ,l] = [0;d 1 + l,...,d n + l] 

where d\ > and all other di > 1. It is known (see, e.g., (4[ Proposition 27]) that the word v £ {0, 1}* 
such that Wp^ q = Pal(v) takes the form v = dl l d2 ds • • • x dn where x = if n is odd and x = 1 if 
n is even. For example, 2/5 = [0; 2, 1, 1] = [0; 2, 2] and W2$ = 010 = PaZ(01). Moreover, as in the 
case of characteristic Sturmian sequences (see Theorem [15] and Example O, the two characteristic 
periodic balanced sequences of slope p/q can be obtained by iterated palindromic closure. More 
precisely, with the above notation, we have 

(u>p,gxy)°° = Pal(0 dl l d2 ■ ■ ■ x dn+l y°°) and (w^yx) 00 = Pal(0 dl l d2 ■ ■ ■ x dn yx°°) 

where {x,y} = {0,1}. For example, 

(-u^slO) 00 = (01010) 00 = PaZ(0110°°) and (w 2 ,501)°° = (01001) 00 = Pa/(0101°°). 

In ll23ll Pirillo proved that a word w G {0, 1}* is a palindromic prefix of some characteristic Stur- 
mian sequence in {0, 1} N , i.e., w = Pal(v) for some v G {0, 1}* (see Theorem IT3T> if and only if u;01 
is a circular shift of wlO. From this fact and Proposition [T71 we thus deduce the following result. 

Proposition 19. A word w G {0, 1}* is central if and only ifwOl is a circular shift ofwlO. 

Consequently, for any two coprime integers p, q with < p < q, the two characteristic periodic 
balanced sequences of slope p/q, namely s p / q ,p/q = (wp^W) 00 and s' p , qp , q = (wp^0l)°°, are shifts 
of each other, and therefore they have the same set of factors. 

Remark 20. Recall from Remark [12] that the closure of the shift-orbit of a characteristic Sturmian 
sequence s is precisely the set of all Sturmian sequences having the same slope as s. Moreover, the 
(Sturmian) sequences in 0(s) are exactly the sequences that have the same set of factors as s (see 
for instance |[T9l Propositions 2.1.25 and 2.1.18], or |[2Ti ). Likewise, the shift-orbit of a characteristic 
periodic balanced sequence u consists of all the periodic balanced sequences with the same set of 
factors (and also the same slope) as u. However, in contrast to the aperiodic case, we deduce from 
Proposition [T9l that, if u is a characteristic periodic balanced sequence containing both 0's and l's, 
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then 0(u) contains exactly two distinct characteristic periodic balanced sequences (not just one), 
which take the form (w01)°° and (wW)°° where w is the central word having the same slope as u. 

The following useful result is due to de Luca ifTTTl ; in particular, see HJJ Remark 1 and Proposi- 
tion 9] and also \9, Lemma 5]. 

Proposition 21. Let w be a central word in {0, 1}*. If w = W\01w2 = u^lO^i where w\ and w 2 are 
(central) words, then 

Pal(w0) = (u>0) (+) = w 2 Ww l 01w 2 and Pal(wl) = (wl) (+) = lOiOl^lO^i. 

We end this section with a result (Corollary [23]below) that will be particularly useful in the proof of 
our main combinatorial theorem. Let us first recall that a non-empty finite word v over an alphabet A 
is said to be a Lyndon word (resp. anti-Lyndon word) if v is a primitive word that is lexicographically 
less (resp. lexicographically greater) than all of its circular shifts with respect to a given total order 
on A. 

Proposition 22. [3, Theorem 3.2 and Corollary 3.1] A non-empty finite wordv G {0, 1}* is a balanced 
Lyndon word (resp. balanced anti-Lyndon word) with respect to the lexicographic order if and only if 
v = Owl (resp. v = lw0)for some central word w G {0, 1}*. 

As a direct consequence of the above proposition, we have the following result. 

Corollary 23. For any central word w G {0, 1}*, the (primitive) words Owl and lwO are the lexico- 
graphically least and greatest words amongst their circular shifts. 

4. Proof of Theorem [T3l 

- Assertions (i) and (ii) are straightforward (see [16, Lemma 2.4]). 

In order to prove the other assertions, we first recall the inequalities that are equivalent to s = cj)(0u) 
from Theorem [TT] 

(1) Ou < T k (s) < lu and T k (s) < s for all k > 0. 

- Assertion (iii) is a consequence of Gan's result (Theorem ITTb together with Theorem [9] (see 
Remark[l2]>. 

- We will now prove Assertions (iv) and (v). Suppose that u is not a characteristic Sturmian se- 
quence and that u contains both 0's and l's. Then we know from Theorem [TT] that s := (f)(0u) is 
a periodic balanced sequence satisfying the inequalities in (Q]). Indeed, s cannot be Sturmian, for 
otherwise u would be a characteristic Sturmian sequence by Remark [T2l 

By Remark[20l 0(s) contains exactly two distinct characteristic periodic balanced sequences, given 

by 

soi := (w0i)°° and s w := (u>10)°° 

where w G {0, 1}* is the central word with the same slope as s. We now deduce from Corollary 1231 
that the lexicographically least sequence in 0(s) is 

(Owl) 00 = 0(u>10)°° = Osio 
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and the lexicographically greatest sequence in O(s) is 

(lu>0)°° = 1(^01)°° = lsoi- 

Hence, 

(2) 0s w < T k (s) < Isqi for all k > 0. 

Moreover, since s is the lexicographically greatest sequence in its shift-orbit (by the second inequality 
in (fl}), we have s = (lw0)°° = ls i. 

We will now show that soi < w < sio- Since Osio and lsoi are the lexicographically least and 
greatest elements in O(s), the inequalities in (Q]) imply that 

Ou < Osio and lsoi < lw. 

Hence s 01 < u < s 10 ; that is, (w01)°° <u< (wW)°°. 

Furthermore, we note that there does not exist another central word z such that (z01)°° < u < 
(zl0)°°. For if so, then the set [Ou, lu] := {s G {0, l} N ,0u < s < lu} would contain the peri- 
odic balanced sequences O(zlO) 00 = (Ozl) 00 and l(z01)°° = (lz0)°°, and hence all of the shifts 
of the characteristic periodic balanced sequence (z01)°°, since the former two sequences are the 
lexicographically least and greatest sequences in the shift-orbit of (z01)°° (by Proposition [19] and 
Corollary |23T). But by |[T6l Lemma 5.4], the set [Ou, lu] contains a unique periodic balanced shift- 
orbit. Therefore, since O((w01)°°) C [Ou, lu], we must have z = w. We have thus established the 
following lemma. 

Lemma 24. Suppose u is a sequence in {0, 1} N \ {0°°, 1°°} that is not characteristic Sturmian. 
Then there exists a unique central word w £ {0, 1}* such that (w01)°° < u < (wl0)°°. Moreover, 
4>(0u) = (lw0)°°. 

Assertions (iv) and (v) are direct consequences of the above lemma, and the last statement in the 
theorem follows from Theorem [TTJ □ 

5. A COROLLARY 

By Theorem \15\ the set of all characteristic Sturmian sequences in {0, 1} N is given by 

S = {u G {0, 1} N , 3v G {0, 1} N \ ({0, 1}*0°° U {0, , u = Pal{v)}. 

And it follows from Proposition [T7] that the set of all characteristic periodic balanced sequences in 
{0, 1} N is given by 

p = {0 00 ,l 00 }UPoiUPio 

where 

Vqi := {u G {0, 1} N , 3v G {0, 1}*, u = (Pal(v)Ol) 00 } 

and 

p w ■= { u G {0, 1} N , 3v G {0, l}*,u= (PaZ(v)lO) 00 }. 

Given a characteristic periodic balanced sequence in {0, 1} N of the form s := (Pal(v)xy) 00 where 
v G {0, 1}* and {x, y} = {0, 1}, we let s denote the other characteristic periodic balanced sequence 
in the shift-orbit of s, i.e., s := (Pal(v)yx) 00 (see Remark l20l). 
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As an immediate consequence of Theorem [T3l we obtain the following description of the lexico- 
graphic world. 

Corollary 25. We have C = {(01°°, 1°°)} U C U d U £ i U Ao U £* where 



C 


= {(0°° 




Ci 


= {(1", 


l°°),ti€{0,l} N }, 


Coi 


= {(0u, 


v) G 0(5 U Voi) x {0, 1} N , v > lu}, 


Cio 


= {(0u, 


v) G OPio x {0,1} N , v > lu}, 


r 

if; 


= {(0u, 


v) G ({0, 1} N \ 0(5 U V)) x {0, 1} N , 3s G P Q1 , s < u < s, v > Is} 



6. HOW TO DETERMINE 0(Ou) FOR A GENERIC SEQUENCE U 

The following theorem provides a method for determining the central word w such that 0(Ow) = 
(1^0)°° for any "generic" sequence u G {0, 1} N falling into Case (v) of Theorem [T3l Hereafter, a 
prefix of u that is a central word is called a central prefix of w. 

Theorem 26. Suppose u is a sequence in {0, 1} N \ {0°°, 1°°} that is neither a characteristic Sturmian 
sequence nor a characteristic periodic balanced sequence. Let v be the longest central prefix of u. 
Then v is finite (v ^ e), and (f)(0u) is determined as follows. 

(i) Ifv = l k for some k > 1, then cp(0u) = (l k 0)°° = (lw Pjq 0)°° where p = k and q = k + 1. 

(ii) Ifv = k for some k > 1, then cp(0u) = (10 fc )°° = (IWp^O) 00 where p = 1 and q = k + 1. 

(iii) Suppose v contains both 0's and l's. Let v\, vi be the unique pair of central words such that 
v = V1OIU2 = V2WV1 where £\ := \v\\ +2 and £2 '■= \v2\ + 2 are coprime periods of v. 
Consider the prefix of length 2\v\ + 4 of u, namely the prefix vxyz where x, y G {0, 1} and 
\z\ = \v\ + 2. 

(a) If either xy = 01 and z > vOl, or xy = 10 and z < vlO, then (j)(Qu) = (lf0)°° = 
(lwp !q 0)°° where p = \v |i + 1 and q = \v\ + 2 = £\ + £ 2 . 

(b) If either xy = 01 and z < vOl, or xy = 00, then (j)(0u) = (lf 2 0)°° = (lvjp^O) 00 where 

P = \v2\1 + 1 tfwd 9 = £-2- 

(c) If either xy = 10 dfirfz > i>10, orxy = 11, then (j)(0u) = (lt>i0)°° = (lw P) q0)°° where 

P = + 1 Q = Pi- 
Note. In Assertion (iii), it cannot happen that z = vxy when x / y. For instance, if xy = 01 and 
z = vOl, then it would begin with the following word: 

uOlvOl = u 2 10wi01^ 2 10ui01 

where the prefix f2l0fi01f2 = v01v2 is a central word, by Propositions [171 and |2~T1 But then u has a 
central prefix longer than v; thus z / f 01. Similarly, if xy = 10, then 2 / vlO. 

The following lemma is needed for the proof of Theorem [26] 

Lemma 27. Suppose v is a central word in {0, 1}* \ (0* U 1*). Let v\, V2 be the unique pair of central 
words such that v satisfies the equation v = V1OIV2 = t^lO-ui. Then f01t>2 (resp. vlOvi) is a prefix 
of the characteristic periodic balanced sequence (W2IO) 00 (resp. (fi01)°°J. 



12 



JEAN-PAUL ALLOUCHE AND AMY GLEN 



Note. By Propositions [XT] and EH the words vQlv2 and vlQv\ are central words since uOlt^ = 
Pal{vO) and vWv% = Pal(vl). 

Proof of Lemma \27\ We will prove only that (v 2IO) 00 begins with the central word Pal(vO) = v01v2, 
since the proof of the other case is very similar. 

By Proposition fTTl li := |wi| + 2and^2 := I V2 I + 2 are coprime periods of v where \v\ =£±+£2 — 2. 
In particular, since £2 = \v2l0\ is a period of v with £2 < \v\, there exists an integer k > 1 such that 
v = (V2W) v' 2 where v 2 is a (possibly empty) prefix of V2IO, in which case v\ = (V2IO) since 
v = i>2l0i>i. Moreover, since v is a palindrome, v 2 is a prefix of v, and therefore v' 2 = v 2 , i.e., u 2 
is a palindrome. Furthermore, u 2 01 is a prefix of v since its reversal 10t> 2 is a suffix of u. We will 
now show that the central word Pal(vO) = v01v2 is a prefix of the characteristic periodic balanced 
sequence (t^lO) 00 by considering five different cases according to the length of the palindrome v 2 . 

(1) If v 2 = U2IO, then v = (t>2i0) fc+1 and v\ = (v2l0) k . Hence, v being a palindrome implies 
that V2W is a palindrome and we have v = (v2l0) k+1 = (01v2) k+1 . Therefore, 

(V2W)°° = (v 2 W) k+1 OlWafalO) 00 = W01U2M0) 00 . 
v 2 10 

Thus, the central word Pal(v0) = v01v2 is a prefix of (t^lO) 00 . 

(2) If v 2 = U2I, then since u 2 and t>2 are palindromes, we have V2I = 1«2 , and hence t>2 is a 
power of 1; in particular, V2 = 1^ 2 ~ 2 . Therefore 

( V2 10)°° = (^lO^lttoalOfalO) 00 

= (7J2lO) fc w 2 1011^0(u2lO) 00 

v lv z 

= uOl^O^lO) 00 
= Pal{v0)0(v2l0)°°. 

(3) If f 2 = V2, then t> = (i^lO)^, and therefore v\ = (i^lO)^ -1 ~v%. But this implies that 
l\ = k£2, which is impossible since £\ and £2 are coprime integers greater than 1. 

(4) If V2 = v' 2 0, then since v 2 and v' 2 are palindromes, we have v' 2 = 0v' 2 . Therefore v' 2 (and 
hence V2) is a power of 0; in particular, V2 = f/ 2-2 . Thus 

( U2 io)°° = mo^io^ioMo) 00 

= (u 2 10)^01 u 2 10(u 2 10)°° 

V 0V2 
= U01W2010(V210)°° 

= Pal(vO)OW(v 2 W)°° . 
Note that we cannot have V2 = v' 2 l because v' 2 01 and V2 are both prefixes of v. 

(5) If \v' 2 \ < I 1 — 2, then V2 = v' 2 0lv 2 for some (possibly empty) word v 2 G {0, 1}*, in which 
case i> = (f 2 01?j 2 10) fc t> 2 . (Note that neither v' 2 l nor v 2 00 is a prefix of V2 because v' 2 01 and V2 
are both prefixes of v.) Since v is a palindrome that begins with the palindrome V2 = v' 2 01v 2 
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and therefore ends with V2 = t>2 = v'^Wv^, we see that v'^lOv^ = v'^Wv^. Hence v' 2 ' is 
a palindrome. Moreover, v' 2 ' is a central word since v' 2 ' is a palindromic prefix (and also a 
palindromic suffix) of the central word V2 and any palindromic prefix (or suffix) of a central 
word is central (see lfl2l or |[T9l Corollary 2.2.10]). Thus, by Proposition [T71 V2 satisfies the 
equation V2 = v'^Wv^ = v^Olv'^. Hence, we have 

(^10)°° = (va^S^OKlOualO^lO) 00 

v 2 10 

= (v 2 10) fc t> 2 01 Ua'lO^Ol v' 2 ' 10(u 2 10)°° 
V201 

= vOl^Ol^lO^lO) 00 
= Pal(v0)01v'2 f 10(v2l0)°°. 

In all of the above cases (with the exception of the impossible case (3)), we have shown that the 
central word Pal{vQ) = i>01?; 2 is a prefix of (i^lO) 00 , as required. □ 

Proof of Theorem \26\ Suppose u is a sequence in {0, 1} N \ {O 00 , 1 00 } that is neither a characteristic 
Sturmian sequence nor a characteristic periodic balanced sequence. Then the longest central prefix of 
u, say v, is non-empty since it could (at the very least) be a letter. Furthermore, v is finite; otherwise, if 
v were infinite, then u would be either a characteristic Sturmian sequence or a characteristic periodic 
balanced sequence (see Theorem [15] and Remark [T8l). 

We know from Theorem [TBI (or Lemma 1241) that there exists a unique central word w G {0, 1}* 
such that (u>01)°° < u < (wl0)°°, in which case </>(0it) is equal to the periodic balanced sequence 
(lwO) 00 . We will show how to determine w in terms of the longest central prefix v. Note that w is 
either empty or a (palindromic) prefix of v, by the maximality of v. 

First suppose that v = x k for some x G {0, 1} and k > 1. Then by the maximality of v as a central 
prefix of u, it follows that u begins with x k y = vy where y G {0, 1}, y ^ x. Moreover, the prefix of 
length 2k + 1 of u takes the form x k yu where \u\ = k and \u\ x < k — 1; otherwise u would begin 
with x k yx k = Pal{x k y), contradicting the fact that v (= Pal(x k )) is the longest central prefix of it. 
If x = 1, then we easily see that 

(l^Ol) 00 < u (= l k 0u ■■■)< (l fc 0)°°, 

where the latter inequality follows from the fact that |u| = k and u < l k (since u contains at most 
k — 1 occurrences of the letter 1). Hence by LemmaHl 0(Ou) = (l fe 0)°° = (lw Pjq 0)°° where p = k 
and q = k + 1. Similarly, if x = 0, we have 

(0 fc l)°° <u(= k lu ■■■) < (O^IO) 00 , 

where the first inequality follows from the fact that \u\ = k and k < u (since u contains at most 
k — 1 occurrences of the letter 0). Hence by LemmaHfl <p(0u) = (W k )°° = (lw p , q 0)°° where p = 1 
and q = k + 1. We have thus proved Assertions (i) and (ii) of the theorem. 

Now suppose that the longest central prefix v of u contains both 0's and 1 's. Then by PropositionfTTl 
there exists a unique pair of central words v\, V2 G {0, 1}* such that v = fiOlt^ = V2WV1 where 
h '■= \v\\ + 2 and £2 '■= |?>2| + 2 are coprime periods of v, and min{^i, £2} is the minimal period of v. 
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Consider the prefix of length 2\v\ + 4 of u, namely the prefix vxyz where x,y G {0, 1} and 
\z\ = \v\ + 2. We will now prove each of the cases (a), (b), and (c) of Assertion (hi). 

Case (a): Let us first suppose that u begins with vQlz where \z\ = |u01| and z > uOl. Then it 
is easy to see that 

(w01)°° < u < (vl0)°°. 

Hence, by Lemma l24l we have (j)(0u) = (lvQ)°°. Moreover, v = w p , q where p = \v\i + 1 
and q = \v\ + 2 = l\ + £2- Similarly, if u begins with vlOz where \z\ = \vl0\ and z < vlO, 
then (v01)°° <u< (vlO) 00 , and therefore <^(0tt) = {lv0)°° by LemmaHQ 

Case (b): In this case, either u begins with ?;00 or u begins with vOlz where \z\ = \v01\ and 
z < vOl. 

Since V2W is a prefix of v (which in turn is a prefix of u), we have (t^Ol) 00 < u. Fur- 
thermore, by Lemma l27l the characteristic periodic balanced sequence (t^lO) 00 begins with 
the central word Pal(v0) = v01v2- Thus, if u begins with vOO, then u < (t^lO) 00 . On 
the other hand, if u begins with vOlz where \z\ = \v01\ and z < vOl, then we will show 
that u < (V2W) 00 by considering the prefix of length \v\ + £2 of u, namely v01z2 where 
\ z 2 1 = I ^2 1- We first note that z 2 < t>2 since t> 2 is a prefix of v and z 2 is a prefix of z where 
z and v satisfy z < vOl. Furthermore, z 2 < ^2 (i- e -> z 2 7^ v 2)- Otherwise, if z 2 = t>2, then 
u would begin with the central word Pal(v0) = v0lv2- But then u would have a central 
prefix that is longer than v; a contradiction. Therefore Z2 < V2, and hence u < (t^lO) 00 since 
(U2IO) 00 begins with v01v2 where V2 > Z2, as shown above. 

Case (c): This case is symmetric to Case (b). 

□ 

Example 28. The following examples demonstrate the computation of 4>(0u) for sequences u in 
{0, 1} N that are neither characteristic Sturmian nor periodic and balanced. Where appropriate, the 
longest central prefix of the sequence is highlighted in boldface. 

(1) The following two general facts can easily be deduced from the proofs of parts (i) and (ii) of 
Theorem [26] 

(a) (j)(0u) = (l fc 0)°° for any sequence u having a prefix of the form l k 0v where k > 1, 
\v\ = k, and \v\± < k — 1. 

(b) (j)(0u) = (W k )°° for any sequence u having a prefix of the form k lv where k > 1, 
\v\ = k, \v\o < k — 1. 

(2) By part (iii)(a) of Theorem |26j (j)(0u) = (10100100)°° = (lt^O) 00 for any sequence u 
beginning with 

PaZ(010)011 = tU3 )8 011 = 010010011. 

(3) Let u be the (non-characteristic) Sturmian sequence 

1/ = 10100101001001010010100100101 • • • 
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where / is the (binary) Fibonacci sequence (see Examples l2l and IT6l> . Then the longest central 
prefix of u is = 101 = Pal (10) and u begins with w^OO. Therefore, by part (iii)(b) of 
TheoremHO we have <p(0u) = 0(01/) = (10)°° = (lw 1)2 0)°°. 

(4) By part (iii)(c) of Theorem l26l <p(0u) = (10100) 00 = (lu^O) 00 for any sequence u begin- 
ning with 

PaZ(010)101 = tU3 )8 101 = 010010101. 

(5) By parts (iii)(b) and (iii)(c) of Theorem |26l 4>(x) = (10)°° = (lu^^O) 00 for any sequence x 
beginning with 0011 or 0100. In particular, <p(0t) = (10)°° for the Thue-Morse sequence t, 
which is the fixed point beginning with of the morphism 1 — > 01, 1 1 — > 10: 

t = 0110100110010110- •• 

Also note that <p(t) = (110)°° = (lu^O) 00 . 

(6) Recall that the central word w PtQ of slope p/q G (0, 1) (where gcd(p, q) = 1) has length q — 2 
and contains p — 1 occurrences of 1 (and q — p — 1 occurrences of 0). We observe that if 
p > 2q (i.e., if w Ptq contains more l's than 0's), then w p A begins with 1; otherwise, if p < 2q, 
then w Pi g begins with 0. Hence, we deduce the following general facts from part (iii)(a) of 
Theorem [26] 

(a) If p > 2q, then (p(0u) = (lWp^O) 00 for any sequence u beginning with w Pt(] 100. 

(b) If p < 2q, then (j)(0u) = (lw P:q 0)°° for any sequence u beginning with w P)q 011. 

Remark 29. To determine the longest central prefix of a sequence u G {0, 1} N (which is neither a 
characteristic Sturmian sequence nor a characteristic periodic balanced sequence), possibly the easiest 
way is to check each palindromic prefix of u (in order of increasing length) to see if it is equal to 
Pal(u) for some u G {0, 1}*, until there are no more palindromic prefixes or until one reaches a 
palindromic prefix that is not in the image of Pal. 
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